Go top
Paper information

Clinical characteristics and prognostic factors for icu admission of patientswith covid-19: a retrospective study using machine learning and natural language processing

Jose L. Izquierdo, J. Ancochea, I. H. Medrano, J. Tello, A. Porras, M. Serrano, S. Lumbreras, C. del Río-Bermúdez, S. Marchesseau, I. Salcedo, A. Martínez, C. Maté, S. Collazo, J. Barea, M. Villamayor, A. Urda, C. de la Pinta, I. Zubizarreta, Y. González, S. Menke, Joan B. Soriano

Journal of Medical Internet Research Vol. 22, nº. 10, pp. e21801-1 - e21801-13

Summary:

Background

Many factors involved in the onset and clinical course of the ongoing COVID-19 pandemic are still unknown. Although big data analytics and artificial intelligence are widely used in the realms of health and medicine, researchers are only beginning to use these tools to explore the clinical characteristics and predictive factors of patients with COVID-19.

Objective

Our primary objectives are to describe the clinical characteristics and determine the factors that predict intensive care unit (ICU) admission of patients with COVID-19. Determining these factors using a well-defined population can increase our understanding of the real-world epidemiology of the disease.

Methods

We used a combination of classic epidemiological methods, natural language processing (NLP), and machine learning (for predictive modeling) to analyze the electronic health records (EHRs) of patients with COVID-19. We explored the unstructured free text in the EHRs within the Servicio de Salud de Castilla-La Mancha (SESCAM) Health Care Network (Castilla-La Mancha, Spain) from the entire population with available EHRs (1,364,924 patients) from January 1 to March 29, 2020. We extracted related clinical information regarding diagnosis, progression, and outcome for all COVID-19 cases.

Results

A total of 10,504 patients with a clinical or polymerase chain reaction–confirmed diagnosis of COVID-19 were identified; 5519 (52.5%) were male, with a mean age of 58.2 years (SD 19.7). Upon admission, the most common symptoms were cough, fever, and dyspnea; however, all three symptoms occurred in fewer than half of the cases. Overall, 6.1% (83/1353) of hospitalized patients required ICU admission. Using a machine-learning, data-driven algorithm, we identified that a combination of age, fever, and tachypnea was the most parsimonious predictor of ICU admission; patients younger than 56 years, without tachypnea, and temperature <39 degrees Celsius (or >39 ºC without respiratory crackles) were not admitted to the ICU. In contrast, patients with COVID-19 aged 40 to 79 years were likely to be admitted to the ICU if they had tachypnea and delayed their visit to the emergency department after being seen in primary care.

Conclusions

Our results show that a combination of easily obtainable clinical variables (age, fever, and tachypnea with or without respiratory crackles) predicts whether patients with COVID-19 will require ICU admission.


Layman's summary:

This article uses data from the clinical history of first wave covid patients in Castilla La-Mancha to find, using Artificial Intelligence, the factors that best determine whether a Covid patient will require admission to the ICU or not. Tachypnea (rapid breathing) turns out to be a key symptom.


Keywords: artificial intelligence; big data; COVID-19; electronic health records; tachypnea; SARS-CoV-2; predictive model


JCR Impact Factor and WoS quartile: 5,428 - Q1 (2020); 5,800 - Q1 (2023)

DOI reference: DOI icon https://doi.org/10.2196/21801

Published on paper: October 2020.

Published on-line: June 2020.



Citation:
Jose L. Izquierdo, J. Ancochea, I. H. Medrano, J. Tello, A. Porras, M. Serrano, S. Lumbreras, C. del Río-Bermúdez, S. Marchesseau, I. Salcedo, A. Martínez, C. Maté, S. Collazo, J. Barea, M. Villamayor, A. Urda, C. de la Pinta, I. Zubizarreta, Y. González, S. Menke, Joan B. Soriano, Clinical characteristics and prognostic factors for icu admission of patientswith covid-19: a retrospective study using machine learning and natural language processing. Journal of Medical Internet Research. Vol. 22, nº. 10, pp. e21801-1 - e21801-13, October 2020. [Online: June 2020]